Web-based text anonymization with Node.js: Introducing NETANOS (Named entity-based Text Anonymization for Open Science)

Scaling Wikipedia-based Named Entity Disambiguation to Arbitrary Web Text

This paper investigates the “named-entity disambiguation” task on the Web—identifying the referent of a string, found on an arbitraryWeb page. The GROUNDER system, introduced in this paper, addresses two challenges not considered by previous work: how to utilize a priori information (e.g., Bill Clinton is more prominent on the Web than Clinton County) to improve disambiguation, and how to compo...

متن کامل

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Generalization-Based k-Anonymization

Microaggregation is an anonymization technique consisting on partitioning the data into clusters no smaller than k elements and then replacing the whole cluster by its prototypical representant. Most of microaggregation techniques work on numerical attributes. However, many data sets are described by heterogeneous types of data, i.e., numerical and categorical attributes. In this paper we propo...

متن کامل

Generalization-Based k-Anonymization

Microaggregation is an anonymization technique consisting on partitioning the data into clusters no smaller than k elements and then replacing the whole cluster by its prototypical representant. Most of microaggregation techniques work on numerical attributes. However, many data sets are described by heterogeneous types of data, i.e., numerical and categorical attributes. In this paper we propo...

متن کامل

Generalization-Based k-Anonymization

Microaggregation is an anonymization technique consisting on partitioning the data into clusters no smaller than k elements and then replacing the whole cluster by its prototypical representant. Most of microaggregation techniques work on numerical attributes. However, many data sets are described by heterogeneous types of data, i.e., numerical and categorical attributes. In this paper we propo...

متن کامل

Web-based text anonymization with Node.js: Introducing NETANOS (Named entity-based Text Anonymization for Open Science)

نویسندگان

چکیده

منابع مشابه

Scaling Wikipedia-based Named Entity Disambiguation to Arbitrary Web Text

Named Entity Recognition in Persian Text using Deep Learning

Generalization-Based k-Anonymization

Generalization-Based k-Anonymization

Generalization-Based k-Anonymization

ژورنال